ar X iv : 1 20 3 . 18 89 v 1 [ cs . C L ] 8 M ar 2 01 2 Distributional Measures as Proxies for Semantic Relatedness

نویسنده

  • Graeme Hirst
چکیده

The automatic ranking of word pairs as per their semantic relatedness and ability to mimic human notions of semantic relatedness has widespread applications. Measures that rely on raw data (distributional measures) and those that use knowledge-rich ontologies both exist. Although extensive studies have been performed to compare ontological measures with human judgment, the distributional measures have primarily been evaluated by indirect means. This paper is a detailed study of some of the major distributional measures; it lists their respective merits and limitations. New measures that overcome these drawbacks, that are more in line with the human notions of semantic relatedness, are suggested. The paper concludes with an exhaustive comparison of the distributional and ontology-based measures. Along the way, significant research problems are identified. Work on these problems may lead to a better understanding of how semantic relatedness is to be measured.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ar X iv : c s / 01 10 03 8 v 1 [ cs . C C ] 1 8 O ct 2 00 1 Counting Is Easy †

For any fixed k, a remarkably simple single-tape Turing machine can simulate k independent counters in real time.

متن کامل

ar X iv : 1 20 3 . 14 50 v 2 [ cs . P L ] 8 M ar 2 01 2 AD in Fortran Part 2 : Implementation via Prepreprocessor

We describe an implementation of the FARFEL FORTRAN AD extensions (Radul et al., 2012). These extensions integrate forward and reverse AD directly into the programming model, with attendant benefits to flexibility, modularity, and ease of use. The implementation we describe is a “prepreprocessor” that generates input to existing FORTRAN-based AD tools. In essence, blocks of code which are targe...

متن کامل

ar X iv : m at h / 05 01 18 3 v 1 [ m at h . PR ] 1 2 Ja n 20 05 Partly Divisible Probability Distributions

Given a probability distribution µ a set Λ(µ) of positive real numbers is introduced, so that Λ(µ) measures the " divisibility " of µ. The basic properties of Λ(µ) are described and examples of probability distributions are given, which exhibit the existence of a continuum of situations interpolating the extreme cases of infinitely and minimally divisible probability distributions.

متن کامل

ar X iv : 1 10 8 . 59 74 v 1 [ cs . C L ] 3 0 A ug 2 01 1 Emotional Analysis of Blogs and Forums Data

Pawe l Weroński, Julian Sienkiewicz, Georgios Paltoglou, Kevan Buckley, Mike Thelwall and Janusz A. Ho lyst ∗ Faculty of Physics, Center of Excellence for Complex Systems Research, Warsaw University of Technology, Koszykowa 75, PL-00-662 Warsaw, Poland. Statistical Cybermetrics Research Group, School of Technology, University of Wolverhampton, Wulfruna Street, WV1 1LY Wolverhampton, United Kingdom

متن کامل

Having Fun with Lambert W(x) Function

This short note presents the Lambert W(x) function and its possible application in the framework of physics related to the Pierre Auger Observatory. The actual numerical implementation in C++ consists of Halley’s and Fritsch’s iteration with branch-point expansion, asymptotic series and rational fits as initial approximations. ar X iv :1 00 3. 16 28 v1 [ cs .M S] 8 M ar 2 01 0

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005